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GENERALIZED WASSERSTEIN DISTANCE 
AND WEAK CONVERGENCE OF SUBLINEAR EXPECTATIONS 

XINPENG LI AND YIQING LIN 


Abstract. In this paper, we define the generalized Wasserstein distance for sets of 
Borel probability measnres and demonstrate that the weak convergence of sublinear 
expectations can be characterized by means of this distance. 


1. Introduction 

In classical probability theory, the Wasserstein distance of order p between two Borel 
probability measures p and u in the Polish metric space (O, d) is defined by 

Wp{p,i>) = inf{E[d(A, : law(A) = /r, law(Y) = u}, 

which is related to optimal transport problems and can also be used to characterize the 
weak convergence of probability measures in the Wasserstein space Pp{^) (see Defini¬ 
tion 6.4 in Villani |10jh roughly speaking, under some moment condition, that con¬ 
verges weakly to p is equivalent to Wp{pk,p) —>■ 0. In particular, Wi is commonly called 
Kantorovich-Rubinstein distance and the following duality formula is well-known: 

(1.1) Wi{p,u)= sup {Ep[ip]-E^[ip]}, 

ll¥’llLip<l 

where <1> := {(/j : ||v::||Lip < 1} is the collection of all Lipschitz functions on {Q,d) whose 
Lipschitz constants are at most 1 (see Villani PIO]). 

Recently, Peng establishes a theory of time-consistent sublinear expectations in EE], 
i.e., the G-expectation theory. This new theory provides with several important models 
and tools for studying robust hedging and utility maximization problems in the finan¬ 
cial markets under Knightian uncertainty (see eg. Epstein and Ji [2] and Vorbrink m)- 
According to Denis et al. [T], G-expectation can be expressed as an upper expectation 
introduced in Huber [3j, i.e., Eg[-] := sup^gp where P is a weakly compact set of 

martingale measures. Compared with the sublinear functionals studied by Lebedev in [1], 
G-expectation is non-dominated due to the mutual singularity of elements in V. 

In this paper, the classical notion of distance between two probability measures is ex¬ 
tended, more precisely, we dehne a generalized Wasserstein metric for two weakly compact 
sets of probability measures, which could be regarded as generators of (not necessarily 
dominated) sublinear expectations. This metric is given by a Hausdorff type distance 
function, whose value is the “greatest” of all Wasserstein distances from a probability 
measure in one set to the “closest” probability measure in the other set. Furthermore, 
we adopt the notion of weak convergence for sublinear expectations from Peng mum 
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and then characterize this type of convergence by the generalized Wasserstein metric. 
We notice that the main techniques in this paper could be applied to considering trans¬ 
port inequality in the sublinear expectation context, for example, the transport-entropy 
inequality and the related large deviation principle. Also, the results may provide with 
some useful tools for the further study of robust optimal transport problems on sublinear 
expectation spaces. 

This paper is organized as follows; in the next section, we recall some notations in the 
framework of sublinear expectations and define the generalized Wasserstein distance. In 
Section 3, the duality formula for generalized Kantorovich-Rubinstein distance are given 
and characterization of the weak convergence of sublinear expectations is discussed. 

2. Preliminaries 

We first recall preliminaries in the framework of sublinear expectations. Let (fl, d) be a 
Polish space equipped with the Borel u-algebra B{Q). In the sequel, we only consider the 
Borel probability measures on this space and denote by 'P(II) the collection of all sets of 
these measures. For each V G ’P(ri), one can define a sublinear expectation ]E^[-] in the 
following form: 

(2.1) := supK^[v?], V<y9 € T°(n), 

fj.&V 

where is the space of all ;B(II)-measurable real functions. 

It is easy to check that E^[-] satisfies the following properties: for X, Y G 

(1) Monotonicity: U X >Y, then E’^[X] > E'^[y]; 

(2) Constant preserving: E^[c] = c, Vc G M; 

(3) Sub-additivity: E^ [X + Y]< E^ [A] E^ [Y] ; 

(4) Positive homogeneity: E^[AA] = AE^[A], VA > 0, 

where the usual rule 0 • oo = 0 is applied in (4). 

In Peng EE], the properties of sublinear expectations are systematically studied and 
some limit theorems are proved. In particular, Peng [7] introduces the following notion 
of weak convergence for sublinear expectations: 

Definition 2.1. We say that a sequence {E^"[-]}^^ of sublinear expectations converges 
weakly to some E^[-], if for each hounded and continuous function ip G Ch{^), 

lim E^"[ip] =E^[ip]. 

n^oo 

Now we define the generalized Wasserstein metric as follows: 

Definition 2.2. LetVi, T ’2 G V{Vl). For p > 1, we define the p-order Wasserstein metric 
between Vi and V 2 in the following form: 

hVp(7’i, P 2 ) = max I sup inf Wp{p,iy), sup inf Wp{fj,,n)\, 

\ ueV2 J 

where Wp{pi, v) is the classical p-order Wasserstein distance between two probability mea¬ 
sures fi and V. 

It is evident that the function Wr, is ordered in p as its classical counterpart W„, namely, 
Wp < Wg, if 1 < p < q. 
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Remark 2.3. It is easy to see that Wp is a semi-metric and it will be a metric if we only 
consider weakly compact elements in V{Il). 

Similarly to Definition 6.4 in Villani m. we can define a space on which Wp 

defines a finite distance. 

Definition 2.4. (Wasserstein space) For p > 1, we denote by Vp{Il) a subset ofV{Il), 
which is the collection of all V such that 

(a) V is weakly compact and convex; 

(b) For an arbitrary point ujq G D, 

Remark 2.5. 

(i) It is easy to verify that the space Vp{H) does not depend on the choice of ujq. 

(a) The assumptions for the space Vp{Il) are crucial for our main result. In particular, 
both assumption (a) and (b) enable us to apply Sion’s result [8] and obtain a minimax 
theorem (see Theorem \2.7) . Besides, the assumption of weak compactness (a) on the 
set V ensures the “lower continuity” of the sublinear expectation E^[-] for the indicator 
functions of closed sets or continuous functions, i.e., [</?„] f E^[(/j], if (fn = or 

ifn G Cb{Q) and (pn i ^ (see Lemma 7 and 8, Theorem 31 in Denis et al. [T]j. 

(in) We remark that a typical sublinear expectation, the G-expectation defined in Peng 
[5l|6], is associated with such a set of probability measures, according to [T]. 

(iv) The sublinear expectation admits an unique representation onVp{Ll), i.e., let Vi,V 2 G 
Vp{Ll), i/E^i[(/?] = \/ip g Cb{Ll), then Vi = V 2 - The proof can be easily derived by 

Corollary \2.8\. 

The main technical tool in this paper is the minimax theorem in [S] as follows: 

Theorem 2.6 (Corollary 3.3 in [S]). Let M and N he convex spaces, one of which is 
compact, and f a function on M x N, quasi-concave-convex and upper semi-continuous- 
lower semi-continuous. Then sup inf / = inf sup /. 

In the rest of this paper, the following special case of the above theorem will be quoted, 
where / is a linear and continuous function. 

Theorem 2.7. Let V G Vi{Ll). Fixing a singleton {/r*} G Vi{Ll), we have 
inf sup {E^*[<p\-E^[<p\} = sup \ni{E^*[ip\-E^[tp\}. 

Corollary 2.8. Let V G Vi{D). If there exists a singleton {/i*} G Vi{D) such that 
(2.2) E^.[^]<E^[ip], ^ipeCbin), 

then p* ^V. 

Proof. By (b) in Definition 12.41 we can use <!>:={(/?: ||¥5||Ljp<i} instead of Cb{Ll) in (|2.2I1 . 
Then we can rewrite (ESI) as 


By Theorem 12.71 we have 


sup inf [(/?] - E^[ip]} = 0. 
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Since V is weakly compact, we can choose gV such that sup^g${-E^* [^p] — Ep\ip\} = 0, 
which implies that p* = ft G V. 

□ 


3. Main results 


In this section, we first study the duality formula for the generalized Kantorovich-Rubinstein 
distance Wi on Vi (fl) defined in the previous section. 


Theorem 3.1 (Duality formula for the generalized Kantorovich-Rubinstein distance). 
Let Vi, V 2 G Vi{LL). Then, we have 

y^i{Vi,V 2 )= sup 

\W\\eip<i 


Proof. By the classical duality formula for the Kantorovich-Rubinstein distance and The¬ 
orem \T7l we have 

sup inf sup inf sup {Efj,[ip] — E,^[p]} 

= sup sup inf {Efj,[p] -Kj,[(/?]} 

= sup sup mi {Efj,[p] - E,^[(p]} 
ll</^llLip<iMe^i ^e^2 

= sup {E'^^[p]-E'P^[ip]}. 
ll</’llLip<l 

Similarly, we can obtain 

sup inf = sup [(/?]—[ 99 ]}. 

||¥>||Lip<i 

Therefore, 

>Vi(Pi,P 2 ) = max{ sup [p] - [ip]}, sup {E'P^[p]-E'^^[p]}} 

ll¥’llLip<l llv’llLip^l 

= sup {\E'P^[p]-E'P^[p]\}. 

\\^\ lLip<l 

□ 


It is well-known that Wasserstein distances can metrize weak convergence in the clas¬ 
sical probability theory. Correspondingly, we can prove a similar theorem for sublinear 
expectations. Before presenting this result, we shall discuss some properties of the weak 
convergence of sublinear expectations. 


Given a collection of probability measures V, define 

V{A) := sup P{A)- V{A) := inf P{A). 

Pg-p P&T 

Then, we have the following proposition: 

Proposition 3.2. Let p > 1. Suppose that {Vn}ff=i and V are weakly compact sets of 
probability measures on {Ll,d). If {E^"[-]}'^^i converges weakly to E^[-], then we have 
(i) For each closed set F, 

limsupP„(F) < V{F). 

n^oo 
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Or equivalently, for each open set G, 

limmfP„(G) >V{G). 

n^oo - 

(a) The set V* = U^i 'Pn is tight. 

Proof, (i) For a closed set F, there exists a sequence of bounded continuous function 
such that 'tpk f '^F, thus from Theorem 31 in Denis et al. [T], ¥P[ifk] i P{P)- On 
the other hand, 


limsupP„(F) < limsup= E^[V’A:], VA: G N, 

n—>-oo n—^oo 

which implies that (i) holds. 


(ii) Since D is Polish, we can choose a subset {cui, • • • , ujn, • • • } dense in Q and for a fixed 
m G N, we consider the cover P) := {uj : d{u!i,u}) < 


Fixing e > 0, we first prove that for each m, there exists km such that 
(3.1) 

Otherwise, there exists mo, such that for each k, there exists n^. 


(U B(ui, -1) j > 1 - e/2"', Vn € N. 


For each n G N, from the weak compactness of Vn and Lemma 8 in [I], we know 
Pn{\\ —))''^ t 1, as k ^ oo. 


Thus, we can prove by contradiction that = oo. Then, by (i), Vj G N, 

< liminfT^nfe 1 M —) ) 

< liminf ( M B{uji, —) ) < 1 - e/2™o , 

fc-/oo —mo I 


which implies that 


v{^\jB{u:,,^)y^>e/2^\ Vj G N. 


This is a contradiction to the weak compactness of V. 
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Choosing km as in (|3.ip . we can verify that K = nm=i U^=iis compact. 
Then, for each n, we have 


( OO km -I \ 

U 

m=l i=l ) 

OO / km \ 


m=l 

CXD 


i=l 


E 1-^ 


m=l 

CXD 

< E A2' 

m=l 


V2=l 




□ 


Therefore, the set V* = IJ^i is tight. 

Proposition 3.3. Let p > 1. Suppose that {Vn]^=i and V are elements in Vp{Ll). If 
{Pni^i satisfies 


(3.2) 


lim limsupE^"[d(a;o,-)^l{c^Gn: d(ujo,‘^)>K}{-)] = 0, 

K^oo n^oo y j- 1 


then the following statements are equivalent: 

(i) WpiVn,V) ^ 0. 

(ii) Wi{Vn,V) ^ 0. 

Proof. We only need to prove that “(ii) => (i)”. Let d = inin(d, 1), and denote by Wp 
the classical Wasserstein distances defined with d and Wp the generalized one defined in 
terms of Wp (see Definition 12.211 . We first show that 

Wp{Vn,V) ^ 0 ^ Wp{Vn,V) ^ 0, 

where the necessity is directly deduced from Wp > Wp. 

To prove the sufficiency, we first see that the distance function d is dominated by the 
sum of three parts, that is, for K > 1, u, iv and wq C 




d{u, u) < d{u, Lo) AK + 2d{uj, wo)l{rf(^ ^ + 2d(w, a;o)l{ 

Then, there exists a constant Cp only depending on p such that 

d{u, Qf < Cp{KPd{u, uY + d{uj, aJoY^{dio.,uo)>f} + ‘^oY'i-id{cj,ujo)>f})- 

By the same argument in the proof of Theorem 7.12 in Villani [9], we have for two 
probability measures p and p, 

Wp{p,iyY < CpKPWp{p,uY + CpEp[d{ujo,-Y^{^.. dM>f}(-)] 

+ CpEYd{uJo, d{uJo,uj)>^}^'y\-' 

which implies that 

Wp{Vn,VY < CpKPWp{Vn,VY + CpE'PYd{u;o,-Y^i^.. d(.,,.)>f}i-)] 
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First letting n —>■ oo and then letting K —>■ co, we have Wp{Vn, V) —)• 0. 

Since the distance function d is bounded by 1, all the distances Wp are equivalent (see 
§7.1.2 in 0), so does VVp. Therefore, we have 

Wp{Pn,T)^0 ^ Wp{Vn,V)^0 ^ WiiVn,V)^0 ^ Wi(K, IP) ^ 0. 

□ 

Theorem 3.4 (Wasserstein distance metrize weak convergence). Let p > 1. Suppose 
{Vn}'^=i o,nd V are elements in 'Pp{Q). Then the following statements are equivalent: 

(l) WpiVn,V) ^ 0. 

(a) For each p € (7(12) with the growth condition: for some wq G 12, |7^(<^)I < ^*(1 + 
d[ujQ,ujY), \/uj G Ll, we have 

lim 

n—>-oo 

(Hi) converges weakly to EP[-] and HS.SM holds. 

Proof. First, we prove the equivalence of (ii) and (iii). 

“(ii) =;> (iii)”: It is easy to see that converges weakly to EP[-], thus we 

only need to prove (|3.2p holds. Indeed, the function /r'(<v) := d(wo,w)^l{d(a;o,a;)>A'} 
is upper semi-continuous and thus, can be approximated by pm i fx with \iprn(jj)\ < 
C{1 + d{ojQ,ujy). Then we have 

lim sup E^" [/i^] < lim sup E^" [(/Pm] = Vm G N. 

n—)-oo n—>-oo 

Letting m ^ oo, 

limsupE^"[//f] < EP[//^]. 

n^oo 

Since V G Vp{Ll), from Definition 12.41 fbl. we conclude (|3.2I) . 

“(iii) ^ (ii)”: Giving p G (7(12) with the growth condition |</p(a;)| < (7(1 + d{ujQ,ojy), 
let 

ipK = (-(7(1 + RP) V (/p) A (7(1 + RP) 

and ifx = T — Tk- Then, 

\E^-[p] - E^MI < \E^-[pk] - + |E^[V^k]| 

< \E'^"[pK]-^'^[TK]\+CEP-[d{u:ornw. diuo,u.)>K}{-)] 

+ CE^[d{uJo, d{uJ0 ,lo)>K}{')]■ 

Therefore, 

lim sup I eP'*]:/?] - E^[p]\ <(71imsupE^"[d(wo, d(a;o,(,p)>A:}(-)] 

n^oo n^oo 

+ CE^[d{oJo, •)^l{w: d((.J 0 ,(.j)>A'}(')]' 

Letting R ^ oo, the first term of the right-hand side goes to 0 by (13.21) . the second one 
goes to 0 since V G 7’p(12). Thus (ii) holds. 


Now, we prove “(i) =)> (hi)”- If yVp{Vn,'P) 0, then Wi{Vn,V) 0. 
EH we have 


sup {|E^i¥p]-E^[(/p]|}^0, 

ll<PllLip<l 


By Theorem 
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which implies that lim„_j.oo [(/j] = E^[(^] holds for all Lipschitz functions. For each 
bounded continuous function ip, there exist sequences of bounded Lipschitz functions 
and {^k}T=i such that 

and ^ki^- 


Then, we can deduce 


lim sup E^" [(/?] < liminf limsupE^"[(^^] = liminf E^[(/9^] = E^[(^]. 
n—yoo k—^oo n—yoQ k—¥oo 


Similarly, we have 

lim inf E^"- [ip] > lim sup lim sup E^" [pA 

fc—>oo n^oo 


lim sup E^ 

/c—)-oo 




E^[^]. 


In conclusion, E^"[-] converges weakly to E^[-]. 


It only remains to be seen whether (|3.2p holds. Indeed, one can first verify that for 
each aJ G II, 

+ 2d(a;,a;), 

and immediately obtain 

d{uJo,uYl{d{u,o,Lo)>K} < + 

Then we can deduce that for any pn € Vn and p £V, 

which implies that 

E^-[d(u;o,-fl{..: diu.o,.)>K}{-)] < 2P-^E'^[d{uJ0,-n^^.. d(.o,.)>f }(■)]+‘^"^~"^pi'Pn,Vr, 

where the first term of the right-hand side goes to 0 since V G Vp{Q), and the second one 
vanishes as a result of (i). Consequently, (13.2p holds. 


Finally, we show (hi) => (i). Thanks to Proposition 13.31 we only need to prove the case 
when p = 1. By the definition of Wi, it suffices to prove that 

sup inf Wi{pn,Y) —>■ 0 and sup inf WYpniY) 0- 

/J.ne'P-n 

(a) Suppose sup^^^-p^ iafp^-p Wi{iJ,n, fJ-) 0, then there exists an e > 0 and a subsequence 
{'Pnk}Y=i ^ such that for each Uk, there exists a pnk € Vn^ with 

(3.3) inf IFi(/rn^,/i) > e, 

li£V 

which implies that for all p G V ,Wi{iJLnk, p) > e. By Proposition 13.21 the set {JYLi'^n 
is tight, so it is relatively weakly compact by Prokhorov’s theorem. Then, there exists a 
subsequence of {pnk}kLi^ denoted by {pnk}kLi^ which converges weakly to a proba¬ 
bility measure p*. In this case, we can apply the classical result for Wasserstein distance 
to obtain ITi(//„j,,^*) 0. On the other hand, for each p G Cf,(II), 

E^*[^] = lim < lim E^^k[p] = E’^lp], 

k^oo k^oo 

which implies from Corollary 12.81 that p* G V. Therefore, we obtain immediately that 
inf -5^ 0) as k ^ oo, 

IiGV 

which is in contradiction to (j3.3p . 
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(b) Suppose sup^g-p inf^^g-p^ /i) ^ 0, then there exist an e > 0, a sequence 

C V and a subsequence C {Vn)^=i, such that 

inf ) > 4e. 

Mnj. GPnj. 

Similarly to (a), one can find a subsequence of still denoted by which 

converges weakly to a probability measure //* € V. For sufficient large k, 

inf fFi>3e. 

f^nk ^ 

From the classical duality formula for the Kantorovich-Rubinstein distance dni), we have 
inf = inf sup {E^*[ip\ - [ip]}. 

Since Vn^ <!>:={(/?: ||(/9||Lip<i} are convex as well as is weakly compact, the 
minimax theorem implies 

inf sup {E^*[p] - Ef, [p]} = sup inf {E^*[p] - E^, [p]} 

= sup [p] - [y?]} > 3e. 

ll¥’llLip<l 

Therefore, thanks to (I3.2p . one can find a p £ C'ft(n), such that for sufficient large k, 

Efj.* [p] > [p] + 2e > vJ’Ip] + e, 

which is in contradiction to p* £ V. □ □ 

Corollary 3.5. Let p > 1. Suppose that Vn and V he sets of probability measures on 
(n, d) satisfying all conditions in Definition 1^.41 expect the assumption of convexity. If 
y^p{En,E) —)■ 0, then (ii) and (Hi) in Theorem \3.4\ still hold. 

Proof. Let V* and V* be the convex hull of Vn and V respectively. We first observe that 
for all p, which implies that R*, V* £ Vp{Ll). As in the proof of above 
theorem, we also have 

E^"[d(wo,'Flft.;: d(a;o,aj)>A'}(‘)] = [^(<^0, d(a;o,a;>A'}(‘)] 

thus {R*}2Ti also satisfies (13.2p . So it suffice to prove that Wi(V*,V*) —?> 0, which yields 
>Vp(R*,R*) ^ 0 by Proposition ESI 

Without the assumption of convexity on the sets of probability measures, we still have 
for each n, 

sup infVFi(^, v)= sup inf sup {Ep\p\ — En[p\} 

> sup sup ml{Ep,[p]-En[p\} 

U&'Pn ||‘/?||Lip<l 

= sup sup m.i{Ep[p]-En[p\} 

= sup {E^"[y5] — E^ [(/?]}. 
ll¥’llLip<l 
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Similarly, 

sup inf Wi{fi,u)> sup {E^[<y9] — E^"[99]}. 

U&rl^&'Pn ||¥>||Lip<l 

Therefore, 

m{Vn,V)> sup {\E^-[ip]-EP[ip]\} 

ll</’llLip<l 

= sup = Wi{V:,V*)- 

llYllLip<l 

From which we can imply Wi{V*,V*) —>■ 0. By Theorem 13.41 (ii) and (iii) hold for E^"[-] 
and E^*[-], so do E^"[-] and E^[-]. □ 

Remark 3.6. In general, without the assumption of convexity, the statement (ii) and (iii) 
in Theorem \3.4\ may not imply yVp{Vn,'P) 0. Here is a simple counterexample: consider 
a set V satisfying Definition \2.4\ except for the convexity and such that Wi{V,V*) > 0, 
where V* is the convex hull ofV. Let Vn = V*, for each n G N. Obviously, (ii) and (iii) 
in Theorem 3-4 hold since E^^[lp\ = E^*[(/?] = E^[</5], but Wi{Vn,'P) 0. 
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